Measuring the mass profile of galaxy clusters beyond 
their virial radius 



Antonaldo Diaferio 

Dipartimento di Fisica Generale "Amedeo Avogadro", Universitd 
Via P. Giuria 1, 1-10125 Torino, Italy 

Istituto Nazionale di Fisica Nucleare, Sezione di Torino, 
Via P. Giuria 1, 1-10125 Torino, Italy 



ii Studi di Torino 



Summary. — Traditional estimators of the mass of galaxy clusters assume that the 
cluster components (galaxies, intracluster medium, and dark matter) are in dynam- 
ical equilibrium. Two additional estimators, that do not require this assumption, 
were proposed in the 1990s: gravitational lensing and the caustic technique. With 
these methods, we can measure the cluster mass within radii much larger than the 
virial radius. In the caustic technique, the mass measurement is only based on the 
celestial coordinates and redshifts of the galaxies in the cluster field of view; there- 
fore, unlike lensing, it can be, in principle, applied to clusters at any redshift. Here, 
we review the origin, the basics and the performance of the caustic method. 



1. Introduction 

In the currently accepted cosmological model, galaxy formation is intimately con- 
nected to the formation of the large-scale cosmic structure. To test this model, we need 
to measure the relative distribution of light and matter in the Universe. The mass dis- 
tribution on small scales, from galaxies to galaxy clusters, has been usually inferred by 
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assuming that systems are in dynamical equilibrium. On very large scales, the mass 
overdensities are small enough that the linear theory of density perturbations can be 
used to measure the mass distribution from the relation between the mass density field 
and the peculiar velocities of galaxies [80] . 

On intermediate, mildly non-linear scales, ~ 1 — 10/i -1 Mpc^ 1 ) neither the dynamical 
equilibrium hypothesis nor linear theory are valid. No robust way of measuring the mass 
distribution in this regime was available until the 1990s, when both gravitational lcnsing 
and the caustic technique were developed. Here, we provide an overview of how the 
caustic technique came about and what it has contributed so far. 



2. The context of mass estimators 



2T. The assumption of dynamical equilibrium. - Galaxy cluster mass estimators mea- 
sure either the total mass within a given radius R or the mass radial profile. Traditionally, 
both kinds of estimators are based on the assumption that the cluster is spherical and 
in dynamical equilibrium. 

The virial theorem is usually applied when the number of galaxy redshifts is not large: 
the galaxy velocity dispersion a and the cluster size R are sufficient to yield an estimate 
of the cluster total mass M — cr 2 R/G [82], where G is the gravitational constant. More 
accurate measurements require a surface term correction [70, 39], which can decrease the 
estimated mass by a substantial factor (<~ 20%, on average [28]), and knowledge of the 
galaxy orbital distribution; however, although this distribution can only be reasonably 
guessed in most cases, its uncertainty has only a modest impact (~ 5%) on the final 
mass estimate. These uncertainties become an order of magnitude larger if the galaxies 
are not fair tracers of the mass distribution. 

When the number of galaxy spectra is large enough that we can estimate the velocity 
dispersion profile, we can apply the Jeans equations for a steady-state spherical system. 
The cumulative mass is 



(1) Af(<r) = — 



~~G~ 



din p m dln(^) 
d In r d In r 



2/3(r) 



However, as in the virial theorem, the application of equation (1) requires the assumption 
of a relation between the galaxy number density profile and the mass density profile p m . 
Moreover, we do not usually know the velocity anisotropy parameter 

(2) « r) = 1 _i^_^, 

where v$, vj,, and v r are the longitudinal, azimuthal and radial components of the velocity 
v of the galaxies, respectively, and the brackets indicate an average over the velocities of 



C) Wc use H = 100ft km s _1 Mpc -1 throughout. 
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the galaxies in the volume d 3 r centered on position r from the cluster center. Therefore, 
we can not measure M(< r) without guessing /3(r), or vice versa. A common strategy is 
to measure the velocity distributions of different galaxy populations which are assumed 
to be in equilibrium and thus to trace the same gravitational potential. This method 
can help to break this mass-anisotropy degeneracy although not completely (see [6, 7] 
for very lucid reviews of these methods). 

We can estimate the mass profile when observations in the X-ray band provide the 
intracluster medium (ICM) density p gas and temperature T. The assumption of hydro- 
static equilibrium of the ICM yields a relation similar to equation (1) 



(3) M(<r) 



G/lTOp 



dhiAras _j_ dlnT 



d In r d In r 



where k is the Boltzmann constant, fi the mean molecular weight, and m p the proton 
mass. Note that the term analogous to /?, which appears in equation (2), is now zero, 
because, unlike the galaxy orbits, the ICM pressure is isotropic. When a sufficient an- 
gular resolution and energy sensitivity are not available to measure the X-ray spectrum 
at different clustroccntric radius and thus estimate the temperature profile, an isother- 
mal ICM is usually assumed. However, the departure from this assumption appears to 
be substantial in most clusters where the density and temperature structures can be 
measured (e.g., [38, 19]). 

For estimating the cluster mass when detailed observations of the cluster are unavail- 
able, we can use a scaling relation between the mass and an observable average quantity. 
The most commonly used scaling relations are those involving ICM thermal properties, 
as the X-ray temperature (e.g. [45]; see also [9, 10] for reviews). In this case, however, 
the complex thermal structure of the ICM can significantly bias the cluster mass estimate 
[50]. Rather than using an X-ray observable, one could use, in principle, the integrated 
Sunyacv-Zcrdovich effect, which yields a correlation with mass which is tighter than the 
mass-X-ray temperature correlation [40]. However, this correlation is currently valid only 
for simulated clusters, and still needs to be confirmed by upcoming cluster surveys. 

2'2. Dropping the dynamical equilibrium assumption. - The astrophysical relevance 
of galaxies as gravitational lenses was first intuited by Zwicky [82], but it was only 
fifty years later that the first gravitational lens effect was measured in a galaxy cluster 
[37]. The lensing effect is a distortion of the optical images of sources beyond the mass 
concentration; this distortion depends only on the amount of mass along the line-of-sight 
and not on the dynamical state of this mass. The obvious advantage is thus that the 
dynamical equilibrium assumption, that is essential for all the methods listed above, 
becomes now unnecessary. The lensing effect can be classified as strong or weak lensing, 
depending on its intensity. Strong lensing creates multiple images of a single source and 
can be used for measuring the cluster mass in its core, where the gravitational potential 
is deep enough. In the outer regions, the lensing effect is weaker and it only yields a 
tangential distortion of the induced ellipticities of the shape of the background galaxies; 
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weak lensing can thus measure the depth of the potential well from the center to the 
cluster outskirts. The most serious disadvantage of gravitational lensing for measuring 
masses is that the signal intensity depends on the relative distances between observer, 
lens and source, and not all the clusters can clearly be in the appropriate position to 
provide lens effects that are easily measurable. Moreover, weak lensing does not generally 
have a large signal-to- noise ratio and weak lensing analyses are not trivial (see e.g., [64]). 

In 1997, Diaferio and Geller [22] proposed the caustic technique, a novel method to 
estimate the cluster mass which is not based on the dynamical equilibrium assumption 
and only requires galaxy celestial coordinates and redshifts. The method can thus mea- 
sure the cluster mass on all the scales from the central region to well beyond the virial 
radius r 2 oo, the radius within which the average mass density is 200 times the critical 
density of the Universe. Prompted by the iV-body simulations of van Haarlem and van 
de Weygaert [76], Diaferio and Geller [22] noticed that in hierarchical models of struc- 
ture formation, the velocity field in the regions surrounding the cluster is not perfectly 
radial, as expected in the spherical infall model [51, 30], but has a substantial random 
component. This fact can be exploited to extract the escape velocity of galaxies from 
their distribution in rcdshift space. Here, we will provide an overview of this method. 

2'3. Masses on different scales. - It is clear that weak lensing and the caustic technique 
can be applied to scales larger than the virial radius because they do not depend on 
the assumption of dynamical equilibrium. However, the other estimators we mentioned 
above do not always measure the total cluster mass within r 2 oo, as, for example, the virial 
analyses, based on optical observations, usually do. X-ray estimates rarely go beyond 
~ 0.5r2oo, because on these larger scales the X-ray surface brightness becomes smaller 
than the X-ray telescope sensitivity; gravitational lensing only measures the central mass 
within ~ 0.1r 2 oo, where the strong regime applies. Of course, scaling relations do not 
provide any information on the mass profile, but they rather provide the total mass within 
a given radius which depends on the scaling relation used: typically, X-ray, optical and 
Sunyaev-Zel'dovich scaling relations yield masses within increasing radii, but still smaller 
than r 2 oo- 

3. History 

The spherically symmetric infall onto an initial density perturbation is the simplest 
classical problem we encounter when we treat the formation of cosmic structure by grav- 
itational instability in an expanding background [29, 5]. The solution to this problem 
provides two relevant results: the density profile of the resulting system and the mean 
mass density of the Universe flo. 

The former issue has a long history that we do not review here (see, e.g., [81, 20]). The 
basic idea is simple: we can imagine a spherical perturbation separated into individual 
spherical mass shells that expand to the maximum turn-around radius, the radius where 
the radial velocity v pcc (r) equals the Hubble velocity, before starting to collapse. This 
simple picture enables us to predict the density profile of the final object if we assume that 
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mass is conserved, there is no shell crossing, and we know the initial density profile of the 
perturbation, namely the initial two-point mass correlation function £(r); £(r) contains 
the same information as the power spectrum P(k) of the mass density perturbations, 
if these are Gaussian variates. For scale- free initial power spectra P(k) oc k n , the final 
density profile is p oc r~ a with a depending on Sl and n. 

The spherically symmetric infall can also be used to estimate ilo. When the average 
mass overdensity 5(r) within the radius r of the perturbation is small enough, we can 
compute the radial velocity of each shell of radius r according to linear theory 

(4) ^ = -^(r) . 

In the simplest application of this relation to real systems, we assume that galaxies trace 
mass, so that the galaxy number overdensity is simply related to 8; a measure of t> poc 
thus promptly yields Q - ln the 1980s this strategy was applied to the Virgo cluster and 
the Local Supercluster; galaxies in these systems are close enough that we can measure 
galaxy distances d independently of redshift cz, and thus estimate the projection along 
the line of sight, Wp° s c = cz — H d, of the radial velocity v pcc . These analyses indicated 
that Oo = 0.35 ±0.15 [18], in agreement with the most recent estimates [25], but at odds 
with the inflationary value f2g = 1, which, at that time, was commonly believed to be 
the "correct" value. 

A slight complication derives from the fact that the external regions of clusters are 
not properly described by linear theory. We can use instead the spherical infall model. 
In this case, 8 and fio are still separable quantities and we can recast equation (4) as 

(5) ^ = -Iflg-VW 

so that we can still measure fio once 8 is known. Typical approximations are f(8) = 
8(1 + Sy 1/4 [79] and f(8) = 8(1 + S/3)~ 1/2 [77, 16]. A more serious complication is that 
departures from spherical symmetry can be large in real systems and the radial velocities 
v pec derived from their line-of-sight components can be affected by relative uncertainties 
of the order of 50% [77]. 

The measure of absolute distances to galaxies remains a difficult problem today. Thus, 
estimating ri from the infall regions of clusters might not be trivial. However, this 
complication can be bypassed by the intuition of Regos and Geller [51] who were inspired 
by the work of Shectman [67], Kaiser [33] and Ostriker et al. [42]. Kaiser showed that 
when observed in redshift space (specifically the line-of-sight velocities of galaxies cz 
versus their clustrocentric angular distance 6), the infall pattern around a rich cluster 
appears as a "trumpet horn" whose amplitude A(9) decreases with 9. The turn-around 
radius is identified by the condition A(9) = [42]. For the Abell cluster A539, Ostriker 
et al. [42] found the turn-around radius 9 ta ~ 2° ~ 3ft. -1 Mpc. Although the galaxy 
sampling in the infall region of this cluster was too sparse to measure fio (they only set a 
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Fig. 1. - Caustics (solid lines) according to the spherical infall model (equation 7) in the Coma 
cluster (lower panels). The symbols show the galaxy positions in the redshift diagram. Larger 
amplitudes correspond to increasing cosmic densities Qq = 0.2, 0.5, 1.0. The mass overdensity 8 
is estimated from the galaxy number densities (upper panels) based on CfA data (left panels), 
or APM data (right panels). From [75]. 



lower limit £!o > 0.03 with equation 5), the proposed strategy was intriguing, because it 
was showed that measuring galaxy distances independently of redshift was unnecessary 
Regos and Geller [51] quantified this intuition by showing that the relation between 
the galaxy number density n(r) in real space and the galaxy number density n(cz, 9) in 
redshift space is 

(6) n{cz,e)=n{r)(-) 2 - 

\cz) J 

where J is the Jacobian of the transformation from real to redshift space coordinates. 
When J = 0, n(cz, 9) is infinite. This condition locates the borders of Kaiser's horn 
which are named caustics. We can now use equation (5) to relate A{9) to fl (equation 
34 of [51]): 

(7) A{9) ~ ng- 6 r/(5) 

where r and 9 are related by the transformation from real to redshift space coordinates. 

Unfortunately the caustics appeared to be very fuzzy when a sufficiently dense sam- 
pling of the infall region of a rich cluster like Coma was obtained [75] ; consequently the 
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Fig. 2. - Redshift diagrams of clusters in an iV-body simulation of the standard Cold Dark 
Matter (CDM) model with fio = 1- The dots show the dark matter particle positions. The left 
column shows the redshift diagrams of the same cluster, observed along three different lines of 
sight, that has just accreted a group. The right column shows the redshift diagram of another 
cluster, observed along three different lines of sight, that has not had substantial mass accretion 
in the recent past. In the right column, the caustics according to the spherical infall model are 
also shown as solid lines; the smaller (larger) amplitude corresponds to fio = 0.5, (Qo = 1), 
whereas f2 = 1 in the simulation. From [76]. 



measure of fio was rather uncertain (Figure 1). This disappointing result was attributed 
to the fact that the assumption of spherical symmetry is very poorly satisfied and that 
the substructure surrounding the cluster distorts the radial velocity field [76] (Figure 2). 
Being so sensitive to the cluster shape, locating the caustics in the redshift diagram did 
not appear to be a promising strategy to measure f2o- 

A breakthrough came when Diaferio and Geller [22] took a step further than van 
Haarlem and van de Weygaert. In hierarchical clustering scenarios, clusters accrete mass 
episodically and anisotropically [14] rather than through the gentle infall of spherical 
shells. Moreover, clusters accrete galaxy groups with their own internal motion. There- 
fore, the velocity field of the infall region can have substantial non-radial and random 
components. These velocity components both make the caustic location fuzzy, and, more 
importantly, increase the caustic amplitude when compared to the spherical infall model. 
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Fig. 3. - Caustic amplitude vs. projected clustrocentric distance for a simulated cluster in three 
different CDM cosmologies (columns) with [f2o,^A] as shown above the upper panels. The 
cluster is shown right after a major merger (upper row) and at equilibrium (lower row). The 
cosmic time is shown by the scale factor a. The crosses show the actual caustic amplitude. The 
solid lines show the line-of-sight component of the square of the escape velocity: (Ucsc,ios( r )) 1,/2 = 
{— 20(r)[l — /3(r)]/[3 — 2/3(r)]} 1//2 . The dashed lines show the prediction of the spherical infall 
model. Clustrocentric distances are in units of the virial radius rs. From [22]. 



This intuition opened the way to interpret the square of the caustic amplitude A 2 (0) 
as the average, over the volume d 3 r, of the square of the line-of-sight component of 

1 /2 

the escape velocity («cscios( r )} = [~ 20(r)<?~ 1 (/3)] , where <f>(r) is the gravitational 
potential profile and g (equation 10) is a function of the velocity parameter anisotropy 
(3{r). The crucial point here is that the equation A 2 (r) — (Vcsc,ios( r )) holds independently 
of the dynamical state of the cluster. 

This interpretation works amazingly well. Figure 3 shows the results of iV-body 
simulations of the formation and evolution of a galaxy cluster in Cold Dark Matter 
(CDM) models with different cosmological parameters. The caustic amplitude (crosses) 
and (v 2 sc los (rj_)) (solid lines), as a function of the projected distance r±, agree at all 
scales out to ten virial radii r$( 2 ) and independently of the dynamical state of the cluster: 
immediately after a major merging (upper panels) or at equilibrium (lower panels). The 
spherical infall model (dashed lines), which should only hold for r± > rs, always severely 
underestimates the actual caustic amplitude. These simulations and those in [21] also 
show another relevant result: the major effect of the cluster shape is not to make the 



( 2 ) See [22] for the proper definition of the virial radius rs in these plots. 
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caustics fuzzy but rather to yield different caustic amplitudes depending on the line of 
sight. 

The identification A 2 (r) — (v 2 sc i os (r)} can be immediately used to measure the cluster 
mass. If we assume spherical symmetry, the cumulative total mass M(< r) is 



However, in realistic situations, the two logarithmic derivatives are comparable, and we 
thus need to know /3(r): this is generally not the case. Moreover, the most serious obstacle 
in using equation (8) is the fact that sparse sampling and background and foreground 
galaxies yield the estimate of {v 2 sc los (r)} too noisy to extract accurate information from 
its differentiation. 

To bypass this problem, Diaferio and Geller [22] suggested a different recipe to esti- 
mate the cumulative mass 



where w 0.5 is a constant. This recipe has been applied to a large number of clusters 
ever since and it is now becoming a popular tool to measure the mass in the cluster infall 
regions. Below, we justify this recipe and show how it works in practice. 

4. The caustic method 

In hierarchical clustering models of structure formation, clusters form by the aggre- 
gation of smaller systems accreting onto the cluster from the surrounding region. The 
accretion does not happen purely radially and galaxies within the falling clumps have 
velocities with substantial non-radial components. Specifically, these velocities depend 
both on the tidal fields of the surrounding region and on the gravitational potential of 
the clusters and the groups where the galaxies reside. In the previous section, we have 
seen that, when viewed in the redshift diagram, galaxies populate a region with a char- 
acteristic trumpet shape whose amplitude, which decreases with increasing r, is related 
to the escape velocity from the cluster region. 

The escape velocity v 2 sc (r) — —2<fr(r), where <p(r) is the gravitational potential origi- 
nated by the cluster, is a non-increasing function of r, because gravity is always attractive 
and d<f)/dr > 0. Thus, we can identify the square of the amplitude A at the projected 
radius r± as the average of the square of the line-of-sight component (fi 2 os ) of the escape 
velocity at the three-dimensional radius r = r±. To relate (vf os ) to <j){r), we need the 
velocity anisotropy parameter (3(r) (equation 2). If the cluster rotation is negligible, we 
have (vg) = = (vf os ), and (v 2 ) = (v 2 ) — 2(v 2 os ). By substituting this relation into 
equation (2), we obtain (v 2 ) = (vf os )g(ft) where 
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By applying this relation to the escape velocity at radius r, (v 2 sc (r)) = —2<f>(r), and 
by assuming that A 2 {r) = (v 2 sclos ), we obtain the fundamental relation between the 
gravitational potential </>(r) and the observable caustic amplitude A(r) 

(11) -24>(r)=A 2 (r)g(f3) . 

To infer the cluster mass to very large radii, one first notices that the mass of a 
shell of infinitesimal thickness dr can be cast in the form Gdm = — 2(f>(r)J : (r)dr = 
A 2 (r)g(P)J 7 (r)dr where 

(12) F(r) = -^G 9 -^- . 
Therefore the mass profile is 

(13) GM(< r) = f A 2 {r)T p (r)dr 

Jo 

where J-p{r) = T(r)g([3). 

Equation (13) however only relates the mass profile to the density profile of a spherical 
system and one profile can not be inferred without knowing the other. We can solve this 
impasse by noticing that, in hierarchical clustering scenarios, T{r) is not a strong function 
of r [22]. This is easily seen in the case of the Navarro, Frenk and White (NFW) [41] 
mass density profile, which is an excellent description of the dark matter distribution in 
these models: 

(14) ^NFw(r) = 



2(r + r s ) 2 ln(l+r/r s ) 

where r s is a scale-length parameter. If clusters form through hierarchical clustering, 
Fp{r) is also a slowly changing function of r [22, 21]. We can then assume, somewhat 
strongly, that Fp(r) = Tp = const altogether and adopt the recipe 

(15) GM(< r)=T Si f A 2 {r)dr . 

When Tp = 1/2, this recipe proves to yield mass profiles accurate to 50% or better both 
in iV-body simulations and in real clusters, when compared with masses obtained with 
standard methods, namely Jeans equation, X-ray and gravitational lensing, applied on 
scales where the validities of these methods overlap [23] . 

It is appropriate to emphasize that equations (11) and (13) are rigorously correct, 
whereas equation (15) is a heuristic recipe for the estimation of the mass profile. 
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Fig. 4. - Velocity dispersion of the galaxies on the main branch of the binary tree of three real 
clusters while walking towards the leaves (see [23]). There is an obvious plateau when entering 
the tree sector with the cluster members. The filled dots indicate the chosen a used to cut the 
binary tree and thus select the cluster members. 



4T. Implementation. - The implementation of the caustic method requires: (1) the 
determination of the cluster center; (2) the estimate of the galaxy distribution in the 
redshift diagram; (3) the location of the caustics. 

For estimating the cluster center, the galaxies in the cluster field of view( 3 ) are ar- 
ranged in a binary tree according to the pairwise projected energy 

< 16 > E -' = - G ^P + l^ n2 

rip Z TTli + Tflj 

where R p and II are the projected spatial separation and the proper line-of-sight velocity 
difference of each galaxy pair respectively; m; and rrij are the galaxy masses which are 
usually set constant, but can also be chosen according to the galaxy luminosities. 

By walking along the main branch of the tree from the root to the leaves, we progres- 
sively remove the background and foreground galaxies. We identify the cluster members 
by computing the velocity dispersion a of the galaxies still on the main branch at each 
step: a remains roughly constant when we move through the binary tree sector which only 
contains the cluster member (Figure 4), because the cluster is approximately isothermal. 

The cluster members provide the cluster center and therefore the redshift diagram 
(r, v). The galaxy distribution f q (r, v) on this plane is estimated with an adaptive kernel 
method. At each projected radius r, the function <p(r) = J f q (r,v)dv provides the mean 
escape velocity (v^ sc } k ^r = A^(r)ip(r)dr / tp(r)dr where A K is the amplitude of 
the caustics located by the equation f q (r,v) = n. The appropriate k is the root of the 



( 3 ) We clarify that to apply the caustic technique we already need to know that there is a 
cluster in the field of view. The caustic technique, as it is currently conceived, is not a method 
to identify clusters in redshift surveys, as the Voronoi tessellation [48] or the matched filter [47]. 
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Fig. 5. - Median mass profiles, measured with the caustic technique, of dark matter halos in 
samples extracted from CDM models. The cosmological parameters [Oo^a] are shown above 
the upper panels. The upper row shows the most massive halos: M(< rg) > 10 14 Mq for the 
high-density model, and M(< rg) > 2 • 1O 13 M for the low-density models. The lower row 
shows the least massive halos: 10 13 M Q < M(< rg) < 10 14 M Q for the high-density model, and 
10 12 M Q < M{< rg) < 2 ■ 1O 13 M for the low-density models. The numbers of halos in each 



sample are indicated in each panel, 
projected distance r±. From [22]. 



The error bars indicate upper and lower quartiles at each 



equation {v 2 sc ) k .r — 4cr 2 , where a 2 is the velocity dispersion of the members identified on 
the binary tree. Further technical details of this implementation are described in [21, 66]. 

5. Reliability of the method 

5T. Comparison with simulations. - The caustic technique was tested on iV-body sim- 
ulations of cluster formation in CDM cosmologies. Dark matter only simulations showed 
that the caustic amplitude and the escape velocity profiles agree amazingly well out to 
ten virial radii, independently of the cosmological parameters and, more importantly, 
of the dynamical state of the cluster (Figure 3). These simulations also showed that 
the technique works on both massive and less massive clusters (Figure 5). In the latter 
case, the scatter is larger because of projection effects and sparse sampling. In the most 
massive systems, the mass is recovered within 20% out to ten virial radii in most cases. 

To test the implementation of the caustic method in realistic cases, we can use TV-body 
simulations where the galaxies are formed and evolved with a semi-analytic technique 
[34] . Figure 6 shows the mass profile of a single cluster observed along ten different lines 
of sight in such simulations [21]. When comparing this figure with Figures 3 and 5, where 
all the dark matter particles were observed, we see that the caustic technique performs in 
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Fig. 6. - Radial profiles of the caustic amplitude (upper row), cumulative mass (middle row), 
and mass-to-light ratio (lower row) of a simulated cluster observed along ten different lines of 
sight. The thin lines are the profiles estimated from the individual redshift diagrams. The thick 
lines are the real profiles. In the lower panels, the solid line is the mean mass-to-light ratio 
of the simulated universe. Left and right columns are for a cluster in a ACDM and a rCDM 
model, respectively. From [21]. 



this latter case better than when only the galaxies are available. This difference clearly 
originates from the sparser sampling of the velocity field provided by the galaxies. Figure 
6 also shows that projection effects cause the most relevant systematic error. However, 
the uncertainty on the mass profile remains smaller than 50% out to 8ft -1 Mpc from the 
cluster center. 

5'2. Caustic vs. lensing. - In equation (15) the choice of the constant filling factor 
is based on TV-body simulations alone. Therefore, it is not guaranteed that the caustic 
technique can recover the mass profile of real clusters if the simulations are not a realistic 
representation of the large-scale mass distribution in the Universe. 

Other than the caustic technique, the only method for estimating the mass in the 
outer regions of galaxy clusters is based on weak lensing. The comparison between these 
two methods was performed on the clusters A2390, MS1358 and CI 0024 which are at 
the appropriate redshift to have a reasonably intense lens signal and a sufficiently high 
number of galaxy redshifts [23]. Figure 7 shows the redshift diagrams and the mass 
profiles of these systems. Caustic and lensing masses agree amazingly well. The most 
impressive result is for CI 0024. This cluster is likely to have experienced a recent merging 
event [17], and it probably is out of equilibrium: in this system the caustic mass and the 



14 



Antonaldo Diaferio 




Fig. 7. - Comparison between caustic, lensing, and X-ray mass estimates. The left, middle and 
right columns are for A2390, MS1358 and CI 0024, respectively. Top panels: Redshift diagrams 
with the galaxies (dots) and caustic locations (solid lines). Line-of-sight velocities v are in 
the cluster rest-frame. Middle panels: Three-dimensional cumulative mass profiles. The solid 
squares show the caustic mass estimates; the solid lines are the best-fitting NFW profiles to the 
data points within Ih^ 1 Mpc; the dotted lines are the best-fitting NFW profiles to the X-ray 
measures (from left to right: [2, 3, 43]); the dashed lines are the best-fitting isothermal (A2390, 
[69]; MS1358, [31]) or NFW models (CI 0024, [35]) to the gravitational lensing measures. The 
left and right vertical dotted lines show the radius of the X-ray and gravitational lensing fields 
of view, respectively. The two filled circles show the virial estimates of A2390 and MS1358 [12]. 
Bottom panels: Projected cumulative mass profiles; lines are as in the middle panels. The open 
diamonds show the weak lensing measures: A2390, [69]; MS1358, lower limit to the mass profile 
[31]. Filled diamonds show the strong lensing measures: A2390, [46]; MS1358: [1, 26]; CI 0024: 
upper symbol, [73], lower symbol, [11]. Error bars in all panels are 1-cr; error bars on points 
where they seem to be missing are smaller than the symbol size. From [23]. 



lensing mass agree with each other, but disagree with the X-ray mass, which is the only 
estimate relying on dynamical equilibrium. This result therefore proves the reliability of 
the caustic technique and its independence of the dynamical state of the system in real 
clusters. 

6. Application to real systems 

6T. Mass profiles. - Geller et al. [27] were the first to apply the caustic method to a 
real cluster: they measured the mass profile of Coma out to lO/i -1 Mpc from the cluster 
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Fig. 8. - Top panels: Galaxy distribution in the redshift diagram of Coma for three galaxy 
samples of increasing size. There are 332, 480, and 691 galaxies within the caustics in the 
samples L4.25, L10.0, and C10.0, respectively. Note that these samples are not substantially 
larger than the samples in Figure 1 used to estimate Qo with the spherical infall model. The 
bold lines indicate the location of the caustics. Half the distance between the caustics defines the 
amplitude A(r) shown in the middle panels. Bottom panels: The bold lines are the caustic 
mass profiles. The two error bars show the range of the X-ray mass estimates listed in [32]. 
Short-dashed and long-dashed lines are the cumulative mass profile for a softened isothermal 
sphere and an NFW density profile with parameters obtained by fitting the mass profile in the 
range [0, l]/!," 1 Mpc. Shaded areas in the middle and bottom panels indicate the 2-u uncertainty. 
From [23]. 



center and were able to demonstrate that the NFW profile fits the cluster density profile 
out to these very large radii, thus ruling out the isothermal sphere as a viable model of 
the cluster mass distribution (Figure 8). A few years later, the failure of the isothermal 
model was confirmed by the first similar analyses based on gravitational lensing applied 
to A1689 [13, 36] and CI 0024 [35]. The goodness of the NFW fit out to 5 - lO/i" 1 Mpc 
was confirmed by applying the caustic technique to a sample of nine clusters densely 
sampled in their outer regions, the Cluster And Infall Region Nearby Survey (CAIRNS, 
[61]), and, more recently, to a complete sample of 72 X-ray selected clusters with galaxy 
redshifts extracted from the Fourth Data Release of the Sloan Digital Sky Survey (Cluster 
Infall Regions in the Sloan Digital Sky Survey: CIRS, [54]). 

CIRS is currently the largest sample of clusters whose mass profiles have been mea- 
sured out to <~ 3r2oo (Figure 9); Rines and Diaferio [54] were thus able to obtain a 
statistically significant estimate of the ratio between the mass within the turn-around 
radius M t and the virial mass M 2 oo'- they found an average value of M t /M 2 oo — 2.2±0.2, 
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Fig. 9. - Scaled caustic mass profiles for the CIRS clusters. The thin solid lines show the 
caustic mass profiles normalized by r2oo and M200, the total mass within r2oo- The long-dashed 
line shows a singular isothermal sphere, the solid lines show NFW profiles (with concentrations 
c — 3,5, 10 from top to bottom at large radii). The short-dashed lines are Hernquist profiles 
with scale radii different by a factor of two. From [54]. 



which is ~ 50% smaller than the value expected in current models of cluster formation 
[71]. The caustic technique is not limited to clusters, but, when enough redshifts are 
available, it can also be applied to groups of galaxies: on a sample of 16 groups both the 
NFW mass profiles and the ratio Mt /A/200 = 2.3 ± 0.4 are confirmed [53]. 

Rines et al. [56, 55] also used the CIRS sample to estimate the virial mass function 
of nearby clusters and determined cosmological parameters consistent with the WMAP 
values [25]; they also showed that velocity bias is absent in real clusters. 

A good fit with the NFW profile out to ~ 2r2oo was also found by Biviano and Girardi 
[8] who applied the caustic technique to an ensemble cluster obtained by stacking 43 
clusters from the Two Degree Galaxy Redshift Survey (2dGFRS, [15]): here, unlike the 
previous analyses, the caustic method was not applied to individual clusters, because the 
number of galaxies per cluster was relatively small. 

The caustic method does not rely on the dynamical state of the cluster and its external 
regions: there are therefore estimates of the mass of unrelaxed systems, for example, 
among others, the Shapley supercluster [52], the poor Fornax cluster, which contains two 
distinct dynamical components [24], the A2199 complex [58]. 

6'2. Mass-to-light profiles. - By combining accurate photometry with the caustic mass 
of A576, Rines et al. [59] were able to measure, for the first time, the profile of the mass- 
to-light ratio M/L well beyond the cluster virial radius: they found an i?-band M/L 
profile steadily decreasing from ~ 0.5 to Ah^ 1 Mpc, indicating that, in this cluster, dark 
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matter is more concentrated than galaxies. Slightly decreasing M/L profiles were also 
measured in the outer region of five (including A576) out of the nine CAIRNS clusters in 
the if -band [57]. The remaining CAIRNS clusters have an M/L profile which remains 
roughly flat at radii larger than ~ lh^ 1 Mpc. Coma shows a remarkably flat if -band 
M/L profile out to lOh- 1 Mpc [62]. A flat M/L profile beyond ~ 0.5/1" 1 Mpc was also 
found in A1644 in the ii-band [72]. 

These results are due to two reasons: (1) a larger predominance of less luminous late- 
type galaxies in the cluster outer regions; and (2) the fact that the if-band M/L ratio of 
real galaxy systems increases with the system mass [49]. In fact, clusters form by accre- 
tion of smaller systems, as indicated for example by the optical and X-ray observations 
of the A2199 complex [63], and as expected in current hierarchical models of structure 
formation [68]; therefore, the cluster surrounding regions, which mostly contain galaxy 
groups, should naturally have smaller M/L. The positive M/L— mass correlation was 
also obtained in semi-analytical models of galaxy formation [34] and is well described by 
the statistical technique based on the conditional luminosity function [74]. 

The infall regions are the transition between the dense cluster regions and the field 
[4, 44], and the internal properties of galaxies do not vary abruptly at the virial radius 
[60]. Therefore galaxy surveys in the outskirts of clusters, as those mentioned above, can 
clearly constrain models of cluster and galaxy formation. 

7. — Conclusion and perspectives 

The caustic method and gravitational lensing are the only two techniques currently 
available for measuring the mass profile of clusters beyond their virial radius. The caustic 
method requires a sufficiently dense redshift survey with a large field of view and is 
only limited by the time needed to measure a large enough number of galaxy spectra; 
this observing time increases quickly with cluster redshift. On the other hand, lensing 
requires wide-field photometric surveys that need high angular resolution and extremely 
good observing conditions; moreover, the lensing signal is strong enough only when the 
cluster is within a limited redshift range z « 0.1 — 1. 

When the caustic technique was proposed, multi-object spectroscopy was not rou- 
tinely applied to measure galaxy rcdshifts, and the request of 100 or more redshifts in 
the outskirts of clusters appeared demanding. Nowadays this task can be accomplished 
more easily and the popularity of the caustic technique has begun to increase. 

The caustic technique has been tested on iV-body simulations and the mass profiles 
are accurate to better than ~ 50% out to ~ 3 — 4r 2 oo- On the three systems where both 
the caustic method and lensing could be applied, the two methods yield consistent mass 
profiles. This consistency also holds in CI 0024 whose X-ray mass profile disagrees with 
the caustic and lensing profiles; this disagreement is most likely due to the fact that this 
cluster is out of equilibrium and thus the X-ray mass is unreliable. 

The uncertainties on the caustic mass profile are almost totally due to projection 
effects. In fact, the method assumes that the cluster is spherically symmetric, and this is 
rarely the case; therefore the redshift diagram from which the caustic mass is extracted 
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can vary substantially when the cluster is observed along different lines of sight. The 
size of this systematic error is comparable to the systematic uncertainty we have with 
lensing methods which measure all the mass projected along the line of sight. 

What the caustic technique actually measures is the line-of-sight component of the 
escape velocity from the cluster (equation 11). If we can measure the velocity anisotropy 
parameter (3, the caustic technique thus yields a direct measure of the profile of the 
cluster gravitational potential. 

This brief review shows that the caustic technique is a powerful tool for the analysis 
of clusters and their external regions, but its full potentiality still needs to be exploited. 
For example, the a plateau, that appears when walking along the binary tree (Figure 4), 
provides a clean way to identify the cluster members. This issue still needs a throughout 
investigation [66], but very preliminary results, based on a large sample of synthetic 
clusters, show that ~ 90% of the galaxies within the caustics are cluster members and 
that the interloper contamination is comparable or lower than other methods [78]. An 
additional byproduct of the caustic machinery is the identification of cluster substructures 
from the distribution of the galaxies in the binary tree [65]. This topic has also been 
currently investigated [66] . 
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